Abstraction of High Level Concepts from Numerical Values in Databases
نویسندگان
چکیده
ion of High Level Concepts from Numerical Values in Databases Wesley W. Chu and Kuorong Chiang y Computer Science Department University of California, Los Angeles Abstract A conceptual clustering method is proposed for discovering high level concepts of numerical attribute values from databases. The method considers both frequency and value distributions of data, thus is able to discover relevant concepts from numerical attributes. The discovered knowledge can be used for representing data semantically and for providing approximate answers when exact ones are not available. Our knowledge discovery approach is to partition the data set of one or more attributes into clusters that minimize the relaxation error. An algorithm is developed which nds the best binary partition in O(n) time and generates a concept hierarchy in O(n2) time where n is the number of distinct values of the attribute. The e ectiveness of our clustering method is demonstrated by applying it to a large transportation database for approximate query answering.
منابع مشابه
Using Concept Hierarchies in Knowledge Discovery
In Data Mining, one of the steps of the Knowledge Discovery in Databases (KDD) process, the use of concept hierarchies as a background knowledge allows to express the discovered knowledge in a higher abstraction level, more concise and usually in a more interesting format. However, data mining for high level concepts is more complex because the search space is generally too big. Some data minin...
متن کاملOntology-based Induction of High Level Classification Rules
A tool that could be bene cial to the data mining community is one that facilitates the seamless integration of knowledge bases and databases. This kind of tool could form the foundation of a data mining system capable of nding interesting information using ontologies. In this paper, we describe a new algorithm based on the query facilities provided by such a tool, ParkaDB which is a knowledge ...
متن کاملOntology - based Induction of High Level Classi cationRules
A tool that could be beneecial to the data mining community is one that facilitates the seamless integration of knowledge bases and databases. This kind of tool could form the foundation of a data mining system capable of nding interesting information using ontologies. In this paper, we describe a new algorithm based on the query facilities provided by such a tool, ParkaDB which is a knowledge ...
متن کاملIntegrating ontologies and schema for biographic and geographic databases
This paper describes a key set of ingredients to sharing biographical and geographic information that is stored in separate databases. These ingredients include the concepts of geospatial ontologies as well as database schema. A proof-of-concept system was developed with three databases of Chinese history and geography. Background and Relevance Ontologies have been a dominant theme in GIScience...
متن کاملEmergence of Semantic Concepts in Visual Databases
Content-based image retrieval (CBIR) systems can be used also for other purposes than online access to unannotated image databases. In particular, when a CBIR system is equipped with an automatic image segmentation subsystem, keyword annotations given on image level can be focused on specific image segments. In this paper, we show that our PicSOM CBIR system is able to reveal semantic knowledge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994